Nonlinear Stochastic Control and Information Theoretic Dualities: Connections, Interdependencies and Thermodynamic Interpretations
نویسنده
چکیده
In this paper, we present connections between recent developments on the linearly-solvable stochastic optimal control framework with early work in control theory based on the fundamental dualities between free energy and relative entropy. We extend these connections to nonlinear stochastic systems with non-affine controls by using the generalized version of the Feynman–Kac lemma. We present alternative formulations of the linearly-solvable stochastic optimal control framework and discuss information theoretic and thermodynamic interpretations. On the algorithmic side, we present iterative stochastic optimal control algorithms and applications to nonlinear stochastic systems. We conclude with an overview of the frameworks presented and discuss limitations, differences and future directions.
منابع مشابه
From information theoretic dualities to Path Integral and Kullback Leibler control: Continuous and Discrete Time formulations
This paper presents a unified view of stochastic optimal control theory as developed within the machine learning and control theory communities. In particular we show the mathematical connection between recent work on Path Integral (PI) and Kullback Leibler (KL) divergence stochastic optimal control theory with earlier work on risk sensitivity and the fundamental dualities between free energy a...
متن کاملOptimal State Estimation for Stochastic Systems: An Information Theoretic Approach
In this paper, we examine the problem of optimal state estimation or filtering in stochastic systems using an approach based on information theoretic measures. In this setting, the traditional minimum mean-square measure is compared with information theoretic measures, Kalman filtering theory is reexamined, and some new interpretations are offered. We show that for a linear Gaussian system, the...
متن کاملA Variational Approach to Nonlinear Estimation
We consider estimation problems, in which the estimand, X, and observation, Y , take values in measurable spaces. Regular conditional versions of the forward and inverse Bayes formula are shown to have dual variational characterisations involving the minimisation of an apparent information, and the maximisation of a compatible information. These both have natural information theoretic interpret...
متن کاملNonlinear inelastic dynamic analysis of space steel frames with semi-rigid connections in urban buildings
Applied studies addressing semi-rigid connections have been limited. Scant information exists in regulations except little brief information. Therefore, this research analyzes the behavior of three-dimensional steel frames and semi-rigid connections based on beam-column method and non-linear dynamic analysis. Stability functions and geometric stiffness matrix were used to study the non-linear g...
متن کاملCoordinating a decentralized supply chain with a stochastic demand using quantity flexibility contract: a game-theoretic approach
Supply chain includes two or more parties linked by flow of goods, information, and funds. In a decentralized system, supply chain members make decision regardless of their decision's effects on the performance of the other members and the entire supply chain. This is the key issue in supply chain management, that the mechanism should be developed in which different objectives should be align...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Entropy
دوره 17 شماره
صفحات -
تاریخ انتشار 2015